Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children
نویسندگان
چکیده
We administered the Verbal IQ (VIQ) part of the Wechsler Preschool and Primary Scale of Intelligence (WPPSI-III) to the ConceptNet 4 AI system. The test questions (e.g., “Why do we shake hands?”) were translated into ConceptNet 4 inputs using a combination of the simple natural language processing tools that come with ConceptNet together with short Python programs that we wrote. The question answering used a version of ConceptNet based on spectral methods. The ConceptNet system scored a WPPSI-III VIQ that is average for a four-year-old child, but below average for 5 to 7 year-olds. Large variations among subtests indicate potential areas of improvement. In particular, results were strongest for the Vocabulary and Similarities subtests, intermediate for the Information subtest, and lowest for the Comprehension and Word Reasoning subtests. Comprehension is the subtest most strongly associated with common sense. The large variations among subtests and ordinary common sense strongly suggest that the WPPSI-III VIQ results do not show that “ConceptNet has the verbal abilities a four-year-old.” Rather, children’s IQ tests offer one objective metric for the evaluation and comparison of AI systems. Also, this work continues previous research on Psychometric AI.
منابع مشابه
On the Relationship between General Factor and Foreign Language Proficiency
Raven Progressive Matrices (RPMs) and Wechsler Intelligence Scale for Children (WSIC-R) are two common general intelligence measuring scales used in Iranian high schools. In this paper the relationships between g factor scales and students’ reading comprehension, grammar, and vocabulary was examined by correlation and regression analysis. Standard tests of grammar and vocabulary and Cambridge K...
متن کاملبررسی کنشهای شناختی دانشآموزان دارای لکنت
Objective Stuttering is one of the most common speech disorders that generate many complications in children and adults. This disorder involves behavioral, cognitive and emotional interactions. So, the purpose of the current study is to investigate the cognitive functions of students with stuttering. Materials & Methods A descriptive study, comprising of 30 students (8 females and 22 males) fr...
متن کاملChildren with unilateral hearing loss may have lower intelligence quotient scores: A meta-analysis.
OBJECTIVES/HYPOTHESIS In this meta-analysis, we reviewed observational studies investigating differences in intelligence quotient (IQ) scores of children with unilateral hearing loss compared to children with normal hearing. DATA SOURCES PubMed Medline, Cumulative Index to Nursing and Allied Health Literature, Embase, PsycINFO. METHODS A query identified all English-language studies related...
متن کاملWISC-R verbal and performance IQ discrepancy in an unselected cohort: clinical significance and longitudinal stability.
This study examined children from an unselected birth cohort who had Wechsler Intelligence Scale for Children-Revised (WISC-R) verbal and performance IQ discrepancies that placed them beyond the 90th percentile. It was hypothesized that, relative to their cohort peers, these children would be characterized by greater frequency of perinatal difficulties, early childhood neurological abnormalitie...
متن کاملExecutive dysfunction screening and intelectual coefficient measurement in children with attention deficit hyperactivity disorder.
OBJECTIVE To perform a complete Intelligence quotient (IQ) measurement (verbal, performance, and total) and subsequently, to compare executive function (EF) measurements in subgroups of children with attention deficit-hyperactivity disorder (ADHD) with a control group. METHOD We studied a group of children from 7-12 years of age from public elementary schools. Children were selected by means ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Exp. Theor. Artif. Intell.
دوره 29 شماره
صفحات -
تاریخ انتشار 2017